CDS

Accession Number TCMCG011C23646
gbkey CDS
Protein Id XP_021909211.1
Location complement(join(1874847..1875068,1875221..1875352,1875556..1875786,1876308..1876385,1876499..1876564,1876677..1876757,1877229..1877992,1878944..1879140,1879457..1879551,1880367..1880507,1880821..1880898,1881165..1881293,1881448..1881579,1882401..1882520,1882630..1882681,1890962..1891093,1891785..1891837,1892032..1892127,1892960..1893105,1897360..1897495,1897687..1897840,1898481..1898627,1898862..1899026,1899028..1899182))
Gene LOC110823188
GeneID 110823188
Organism Carica papaya

Protein

Length 1234aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA264084
db_source XM_022053519.1
Definition LOW QUALITY PROTEIN: cleavage and polyadenylation specificity factor subunit 1 [Carica papaya]

EGGNOG-MAPPER Annotation

COG_category A
Description Cleavage and polyadenylation specificity factor
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko03019        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
KEGG_ko ko:K14401        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03015        [VIEW IN KEGG]
map03015        [VIEW IN KEGG]
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003676        [VIEW IN EMBL-EBI]
GO:0003723        [VIEW IN EMBL-EBI]
GO:0003729        [VIEW IN EMBL-EBI]
GO:0005488        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005634        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005829        [VIEW IN EMBL-EBI]
GO:0006139        [VIEW IN EMBL-EBI]
GO:0006378        [VIEW IN EMBL-EBI]
GO:0006379        [VIEW IN EMBL-EBI]
GO:0006396        [VIEW IN EMBL-EBI]
GO:0006397        [VIEW IN EMBL-EBI]
GO:0006725        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0010467        [VIEW IN EMBL-EBI]
GO:0016070        [VIEW IN EMBL-EBI]
GO:0016071        [VIEW IN EMBL-EBI]
GO:0031123        [VIEW IN EMBL-EBI]
GO:0031124        [VIEW IN EMBL-EBI]
GO:0034641        [VIEW IN EMBL-EBI]
GO:0043170        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0043631        [VIEW IN EMBL-EBI]
GO:0044237        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0046483        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0090304        [VIEW IN EMBL-EBI]
GO:0090305        [VIEW IN EMBL-EBI]
GO:0090501        [VIEW IN EMBL-EBI]
GO:0097159        [VIEW IN EMBL-EBI]
GO:1901360        [VIEW IN EMBL-EBI]
GO:1901363        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAGTTTCGCAGCCTATAAGATGATGCACTGGCCCACCGGCATCGAGAACTGCGCGTCCGGCTACATTACCCACTGCCGCGCGGACTTCACGCCACAAATCCCGTTAATGCAGACTGACGATCTCGACTCCGAGTGTCATCCAAGCGCGGTATTTCCCGTTCCTAACCTAGTCGTAACCGCAGCTAACGTCCTCGAAATTTACGTGGTTAGGGTGCAGGAAGAAGGCAGCCGAGACTCCAGGAATCCTGCAGAGATGAAACGCGGTGGGGTGATGGATGGTGTTTCAGGGGCCTCTCTCGAGCTTGTTTGCCACAACAGATTGCACGGTAACGTTGTGTCAATGGTTGTACTATCTGTAGGAGGTGGTGATGGTTCTAGGAGAAGAGATTCTATTATCTTGGCCTTTCAAGACGCAAAAATTTCTGTCCTGGAGTTCGACGATTCTATTCATGGCCTTCGTACAAGCTCTATGCATTGTTTTGAGGGTCCAGAATGGATACATTTGAAAAGAGGCAGAGAATCCTTTGCAAGAGGCCCATTGGTAAAGGTGGATCCACAAGGCAGGTGTGGAGGCGTTCTTGTTTATGATTTGCAAATGATAATACTTAAGGCTGCCCAGGCTGGTTCTGGTTTGGTTGGAGATGACGATCCTTTTGGTTCTGGAGGAGCAATTTCTGCTCGTATTGAGTCATCTTACATAATCAATTTACGGGATTTGGAAATGAAGCATGTAAAGGATTTCATATTTGTGCATGGTTATATTGAACCTGTGATGGTCATTCTTCATGAGAGGGAGCTTACTTGGGCTGGTCGTGTTGCATGGAAGCACCATACTTGCATGGTTTCTGCACTTAGTATCAGCACAACTTTAAAACAACACCCCCTAATATGGTCTGCTGCTAATCTTCCACAAGATGCTTACAAACTACTTGCAGTTCCATCTCCAATTGGTGGTGTTCTTGTGATAAGTGCCAATTCAATTCACTATCATAGTCAGTCTGCTTCTTGTGCATTGGCTTTGAACAGTTTTGCTGTTTCAATTGATGGCAGTCAAGACCTGCCAAGATCAAGTTTCAGTGTGGAACTTGATGCTGCCAATGCTACATGGTTACTAAATGATGTAGCTTTGCTATCAACAAAGATGGGGGAATTGCTACTGCTAACACTAATTTATGATGGGCGGGTTGTGCAGAGGCTTGATCTTTCTAAGTCAAAAGCTTCAGTTCTAACTTCGGATATTACAACAATAGGAAATTCCTTGTTTTTTCTGGGTAGCCGCTTGGGTGACAGTTTGCTTGTGCAATTTACTCGTGGCTCAGGCACCTCAATGATGTCTTCCAGTCTGAAGGATGAGAATGGAGACATTGAAGGTGATGGTCCTTTATCAAAACGTTTGCGGAGGTCATCTTCTGATGCTTTGCAAGATATGATTGGTGAAGAGCTATCCTTGTATGGTTCAGCCCCAAATAATTCCGAGTCAGCACAGAAGACTTTCTCATTTGCAGTAAGAGATTCATTTGTTAACATCGGTCCATTGAAAGACTTCGCTTATGGCCTGAGGATTAATGCTGATCCAAATGCAACTGGAGTTGCTAAACAAAGTAACTATGAACTGGTCTGCTGTTCTGGTCATGGGAAGAATGGTGCTCTCTGCATTCTTCGCCAGTCAATTCGCCCAGAAATGATTACTGAGGTTGAACTTCCTGGTTGCAAGGGAATGTGGACAGTTTACCACAAGAACACACGTGGTCATAATGCTGATTCTTCTAAAATGGCTGCAGAGGAGGATGAGTATCATGCTTATTTGATTATCAGCTTAGAGGCTCGGACCATGGTACTTGAAACAGCTGATCTTTTAACAGAAGTTACTGAAAGTGTCGACTATTATGTCCAAGGAAGAACAATTGCTGCTGGGAATTTGTTTGGAAGGCGTCGTGTTATCCAGGTGTTTGAGCATGGTGCTCGGGTTCTTGATGGTTCTTTTATGACTCAGGATTTAAGCTTTGGAGCTTCTAATTCTGATAGTAATACCAGTTCGGAAAACTCCACAGTGTCCTCTGTTTCAATTGCTGATCCCTATGCGTTACTGAAAATGATTGATGGAAGCCTCAGGCTGCTTGTTGGAGATCCTTCTACTTGCACCATTTCTATAAATACACTAGCTGCCCTTGAGGGCTCAAACAAATTAGTGTCAGCGTGCGCTTTGTACCATGATAAAGGACCAGAGCCATGGTTACGCAAGGCCAGTACTGATGCATGGCTTTCTACTGGCATCAGTGAGGCCATTGATGGTGCTGATTGCGGCCCTCATGACCAGGGTGACATATACTGCATTGTTTGTTATGAAAGTGGTGCCCTTGAAATATTTGATGTGCCAAATTTCAACTGTGTTTTCTCTGTGAATAAATTCATGTGTGGAAGAAGTCACCTTCTTGATAGAAATACCCAAGATCCTTGCAAGGATTTACAGAGAGGAACTAGCAATATCTCTGAAGAAGTCAATGCCTCTGGCAGGAAAGAAACTATGCAAAATATGAAGGTTGTTGAATTAGCCATGCAGAGATGGTCTGGACAGCATAGTCGCCCTTTTCTGTTTGGAATATTGACTGATGGGACAATTCTTTGTTACCATGCTTACCTATTTGAGGGTTCAGACAATGCTTCTAAAGTTGAGGACTCTGTTTCCATGACTTCTAGTTCTAGGCTTAAGAATTTGAGATTTGTTCGCATTCCCTTGGATACATACACTAGGGAGGAGATGTCAAATGAGACTGCATGCCAAAGGTTTACCATGTTCAAGAATATTAGTGGTCATCAAGGGTTTTTCCTTTCTGGGTCAAGACCAGTTTGGTGCATGGTCTTCAGGGAACGGCTCCGATTTCATCCACAGCTATGTGATGGATCTATTGCGGCATTCACAGTTCTTCACAATGTGAACTGTAATCACGGGTTCATCTATGTTACATCACAGGGCATTCTGAAGATTTGTCAATTGCCATCCATATCAAACTATGATAATTATTGGCCTGTGCAAAAGGTTCCTTTGAAAGGCACTCCACATCAAGTCACCTACTTCGCTGACAAGAACTTGTATCCGCTTATAGTTTCAGTTCCGGTGAACAAGCCGCTGAATCAAGTTCTTTCATCGCTGGTTGATCAAGAAGCTGGTCAGCAGATTGATAGTCATAATTTGAGCTCTGATGAACTTCATCGCACATACAGTATTGACGAATTTGAGGTTCGGCTCTTGGAACCCGAAAAATCTGGTGGCGTTTGGGAAACTAAAGCTACAATTCCTATGCAAAGCTCAGAAAATGCACTGACTGTGAGAGTGGTAACACTACTTAATACCATCACAAAGGAGAATGAAACCCTGTTGGCTATTGGAACTGCTTATGTGCAAGGAGAGGATGTTGCAGCAAGGGGACGCGTAATTTTGTTTTCAGTTGGAAGAAATAATGATAATACTCAAATTTCGGTTGCAGAAGTTTACTCGAAGGAATTGAAGGGTGCTATATCTGCTTTAGCCTCAATTCAAGGCCATCTACTGATAGCATCTGGTCCAAAAATTATTTTGCACAAGTGGAATGGCACTGAATTGAATGGTGTTGCATTTTTTGATGCTCCACCATTATATGTTGTGAGCCTGAATATTGTAAGTGTTTTCTTTTCCAGTTTAATTAAAGTGTTACTTTCTTAG
Protein:  
MSFAAYKMMHWPTGIENCASGYITHCRADFTPQIPLMQTDDLDSECHPSAVLXPVPNLVVTAANVLEIYVVRVQEEGSRDSRNPAEMKRGGVMDGVSGASLELVCHNRLHGNVVSMVVLSVGGGDGSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWIHLKRGRESFARGPLVKVDPQGRCGGVLVYDLQMIILKAAQAGSGLVGDDDPFGSGGAISARIESSYIINLRDLEMKHVKDFIFVHGYIEPVMVILHERELTWAGRVAWKHHTCMVSALSISTTLKQHPLIWSAANLPQDAYKLLAVPSPIGGVLVISANSIHYHSQSASCALALNSFAVSIDGSQDLPRSSFSVELDAANATWLLNDVALLSTKMGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITTIGNSLFFLGSRLGDSLLVQFTRGSGTSMMSSSLKDENGDIEGDGPLSKRLRRSSSDALQDMIGEELSLYGSAPNNSESAQKTFSFAVRDSFVNIGPLKDFAYGLRINADPNATGVAKQSNYELVCCSGHGKNGALCILRQSIRPEMITEVELPGCKGMWTVYHKNTRGHNADSSKMAAEEDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGARVLDGSFMTQDLSFGASNSDSNTSSENSTVSSVSIADPYALLKMIDGSLRLLVGDPSTCTISINTLAALEGSNKLVSACALYHDKGPEPWLRKASTDAWLSTGISEAIDGADCGPHDQGDIYCIVCYESGALEIFDVPNFNCVFSVNKFMCGRSHLLDRNTQDPCKDLQRGTSNISEEVNASGRKETMQNMKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLFEGSDNASKVEDSVSMTSSSRLKNLRFVRIPLDTYTREEMSNETACQRFTMFKNISGHQGFFLSGSRPVWCMVFRERLRFHPQLCDGSIAAFTVLHNVNCNHGFIYVTSQGILKICQLPSISNYDNYWPVQKVPLKGTPHQVTYFADKNLYPLIVSVPVNKPLNQVLSSLVDQEAGQQIDSHNLSSDELHRTYSIDEFEVRLLEPEKSGGVWETKATIPMQSSENALTVRVVTLLNTITKENETLLAIGTAYVQGEDVAARGRVILFSVGRNNDNTQISVAEVYSKELKGAISALASIQGHLLIASGPKIILHKWNGTELNGVAFFDAPPLYVVSLNIVSVFFSSLIKVLLS